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We present the first uniform treatment of long duration gamma-ray burst 
(GRB) host galaxy detections and upper limits over the redshift range 3 < 2 : < 5, 
a key epoch for observational and theoretical efforts to understand the processes, 
environments, and consequences of early cosmic star formation. We contribute 
deep imaging observations of 13 GRB positions yielding the discovery of eight 
new host galaxies. We use this dataset in tandem with previously published 
observations of 31 further GRB positions to estimate or constrain the host galaxy 
rest-frame ultraviolet (UV; A = I 6 OOA) absolute magnitudes Muv- We then 
use the combined set of 44 Muv estimates and limits to construct the Muv 
luminosity function (LF) for GRB host galaxies over 3 < < 5 and compare 

it to expectations from Lyman break galaxy (LBG) photometric surveys with 
the Hubble Space Telescope. Adopting standard prescriptions for the luminosity 
dependence of galaxy dust obscuration (and hence, total star formation rate), 
we hud that our LF is compatible with LBG observations over a factor of 
600x in host luminosity, from Muv = —22.5mag to > —15.6mag, and with 
extrapolations of the assumed Schechter-type LF well beyond this range. We 
review proposed astrophysical and observational biases for our sample, and hud 
they are for the most part minimal. We therefore conclude, as the simplest 
interpretation of our results, that GRBs successfully trace UV metrics of cosmic 
star formation over the range 3 < z < 5. Our hndings suggest GRBs are providing 
an accurate picture of star formation processes from z zz 3 out to the highest 
redshifts. 

Subject headings: gamma-ray burst: general — galaxies: high-redshift — galax¬ 
ies: luminosity function — galaxies: star formation — cosmology: observations 


1. Introduction 

A fundamental goal of modern cosmology is to understand the history of star formation 
in our Universe, from its earliest epochs to the present day. Efforts over recent decades, 
including ambitious ground-based spectroscopic surveys and space-based observations in the 
ultraviolet (GALEX), optical to near-infrared (Hubble Space Telescope), and longer wave¬ 
lengths (Spitzer, Herschel) have enabled multiple determinations and cross-checks, including 


* Partly based on observations collected at the European Organisation for Astronomical Research in the 
Southern Hemisphere under IDs 089.A-0120(A) and 091.A-0786(A). 
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extinction corrections, of the cosmic star formation rate (SFR) at redshifts ^ ^ 3, and the 
hrst credible estimates over 3 ^ ^ ^ 8; for a recent review and references, see IMadau & 


Dickinson (2014). 


Star formation in the early Universe, at z > 3, is of particular interest as it tracks the 
formation of the first galaxies, which should account for the redshift and timescale of the 
z ^ 6 cosmic reionization and may yield insight into the dark matter-driven formation of the 


hrst cosmic structures (e.g.. 

Yoshida et al. 

2(1(13 

Trent! et al. 

2010 

Tseliakhovich & Hirata 

2010 

Dijkstra & Wyithe 2012; 

Jaacks et al. 

2012 

Trent! et al. 

2013). Accurately quantify- 


ing star formation at these redshifts presents obvious challenges: at these distances even the 
most luminous galaxies are faint and nearly inaccessible to spectroscopic study; moreover, 
high-sensitivity observations are currently (until the advent of the James Webb Space Tele¬ 
scope, JWST) restricted to rest-frame ultraviolet (UV) and optical bands. Nonetheless, the 
underlying scientific promise has motivated intense observational efforts. 


Multiple deep surveys with the Hubble Space Telescope (HST) have collected over 10,000 
photometrically-selected Lyman break galaxies (LBGs) over 3 ^ 2 : ^ 8 ( Bouwens et al.||2011[ 
Oesch et al.| 2012 Ellis et ah 20131 Schenker et al.| 2013 Schmidt et ah 2014t Bouwens 


et al. 2014b, 2015b), with multiband imaging over substantial regions allowing the fitting of 


full spectral energy distributions (SEDs), and thus stellar population models, for individual 
galaxies (e.g., Duncan et al.||2014 ). These surveys have been complemented by HST observa¬ 
tions of the most massive known galaxy clusters, which take advantage of the gravitational- 


lensing boost to extend z > 6 LBG luminosity functions (LFs) by a full magnitude (Atek 


et al. 2015), and have yielded LBG candidates out to z ~ 9.6 (Zheng et al. 2012 Bouwens 


et al. 2014a). 


Meanwhile via ground-based facilities, narrow-band surveys targeting emission-line galax¬ 


ies have collected substantial samples over 5 < z < 7 (Shimasaku et al. 2006; Ouchi et al. 


2010 Ota et al. 2010); cosmic microwave background (GMB) experiments have discovered 


dozens of gravitationally-lensed, dusty, star-forming galaxies at 2 ; > 4 (Vieira et al. 2010 


Mocanu et al. 

2013 

Marsden et al. 

2014 

); and large-area near-infrared (NIR) surveys are 


yielding competitive constraints on the bright end of the z ^ 7 galaxy LF (Bowler et al. 


2014) and pushing the frontier of quasar discovery out to > 7 (Mortlock et al. 2011[). 


With the latest results from the Planck Gollaboration (2015) yielding a reduced estimate 


for the Thomson scattering optical depth to the GMB (tcmb = 0.066 ± 0.012) consistent 
with reionization ending at z ^ 6 (Shull et al. 2012 Robertson et al. 2015 Bouwens et al. 


2015a), the chief remaining uncertainty in our estimates of high-redshift SFR must be the 


quantity of star formation happening in faint galaxies, beyond the reach of even the deepest 
HST helds. Such galaxies are too faint for current galaxy surveys; at higher redshifts, many 
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have flux densities fi, ^ InJy that will challenge even JWST, and thus most conceivable 
observational tests. 

In this respect, long-duration gamma-ray bursts (GRBs) offer an elegant and comple¬ 


mentary approach, currently reaching to z 8 (Tanvir et ah 2009 Salvaterra et ah 2009 


Gucchiara et al. 1120 11 

), that could be extended to z 12 with relatively modest space-based 

observatories ( 

Burrows et al. 2010; Paul et al. 

2011; 

Greiner et al. 

2012 

. Due to their link 

to core-collapse supernovae (SNe) (Stanek et al. 

2003 

; Hjorth et al. 

2003 

), GRBs probe the 


formation of massive stars. Intrinsically luminous and dust-penetrating, GRBs accurately 
pinpoint regions of active SF - in three dimensions - independent of galaxy luminosity, dust 
obscuration, and precise redshift. 

Past efforts to use GRBs as tracers of cosmic star formation (SF) have compared the 
GRB redshift distribution to other independent estimators. Since SF at low redshifts is 
considered well-characterized, the aim of these studies has been to validate or calibrate the 
GRB distribution over 2 ; ^ 3 so as to leverage knowledge of the GRB redshift distribution 
at higher redshifts, 3 ^ ^ 8, to infer cosmic SFR and explore the consequences for cosmic 

reionization. Initial efforts using the Swift (Gehrels et al. 2004) GRB redshift distribution 


found that the GRB rate rose more rapidly with redshift over z ^ A than other estimators 


Yiiksel et al. 

2008 

[Kistler et al. 

2009 


0.6 < a < 1.8, led to ^ > 4 SFR predictions consistent with extrapolations of the LBG LF 


at those redshifts (Kistler et al. 2009). 


Wanderman & Piran (2010) also found a steep rise in the GRB rate at 2 ; ^ 3 and 


a relatively slow decay at z > 4, which they suggested was inconsistent with LBG SFR 
estimates. However, their derived z > 4 SFR (Fig. 9) closely resembles both the evolution- 
corrected GRB-based SFR and the “LF integrated” LBG-based SFR from Kistler et al. 


(2009, their Fig. 4). Given the uncertainties, it may be accurate to characterize these initial 


papers as demonstrating that the z > A SFRs implied by GRB rates and LBG surveys are 
consistent over a range of plausible LBG LF extrapolations and GRB evolution corrections. 

This consistency of GRB- and LBG-derived SFRs would be in accord with 
Robertson & Ellis ( |2012 ), who found that varying their treatment of the unknown redshifts 
of dark bursts (GRBs that lack bright optical afterglows, and are hence missing from redshift 
samples based on afterglow spectroscopy) had a significant impact on the strength of the 
evolution correction needed to bring GRB and other SFR metrics into agreement at z < A: 
putting all dark bursts at their maximum likely redshifts suggested a mild GRB evolution 
correction, a ~ 0.5; while putting all dark bursts at low redshifts favored no correction, 
a 0. Along these lines, we note that luminosity-based selection of GRBs, pursued in some 
cases to improve sample completeness, has been shown to affect the strength of the inferred 
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anti-metallicity bias ([Elliott et al. 2012) 


If differential evolution of the GRB rate and SFR is required at low-redshift, there is 
a broad consensus that this would reflect an anti-metallicity bias in the GRB population, 


thanks to past suggestions on both theoretical (e.g., Woosley & Heger 2006 Yoon et ah 


2006) and observational (e.g., Stanek et al. 2006 Fruchter et al. 2006) grounds. However, 
since the average metallicity of star-forming galaxies drops precipitously from z = 0 to z ~ 3 


(Maiolino et al. 2008), even if anti-metallicity bias is present the implied correction from 
GRB rates to SFR at 2 ; 3 might be modest (Trenti et al. 2013). Indeed, comparison 

of high-resolution cosmology simulations to the observed GRB sample suggests a ~ 0 at 


redshifts oi z > h (Elliott et al. 2015). 


In this paper, we seek to make a different type of global comparison between GRBs 
and UV-based star formation metrics, calculating the UV luminosity function of GRB host 
galaxies over 3 < ^ < 5 and comparing it to prior expectations from LBG surveys (see 


also Schulze et al. 2015). We focus our efforts on this redshift range because: (1) The 
accuracy and completeness of current SFR estimates at these redshifts, in terms of extinction 
corrections and the necessary extrapolation to unobserved faint galaxies, remain subject to 
active debate; (2) At these redshifts any metallicity bias in the selection of GRB host galaxies 
is likely to be reduced or, potentially, negligible; (3) A sufficient number of GRB redshifts 
have been measured within this range to make statistical analyses useful; and (4) Large- 
aperture ground-based telescopes (and HST) can readily detect host galaxies of modest 
luminosity at these redshifts or, alternatively, yield useful constraints on their magnitudes. 
Naturally, we carry out this work in hopes that it will shed light on the connection between 
GRBs and SF not just within, but also beyond, our targeted redshift range. 


In Section below we present 15 observations of 13 GRB host galaxies over 3 < z < 5, 
which yield detections of eight and deep limits of hve GRB hosts. We supplement these data 
with 61 observations of 31 additional host galaxies from the literature, yielding detections 
of 21 and limits of 10 hosts. With a uniform treatment of the new and previously published 
data, informed by the results of the HST LBG surveys, we convert each observation into a 
rest-frame UV (A = I 6 OOA) absolute magnitude Muv or limit, using a AGDM cosmology with 
Ho=70 km/s/Mpc, r2M=0.3, r2A=0.7. In Sectionj^we construct the UV LF for the GRB host 
galaxies, and compare it to the SFR-weighted LF of Lyman-break galaxies in this redshift 
range. We then discuss the possible influence of proposed astrophysical biases, including an 
anti-metallicity bias in GRB production and suppression of star-formation in low-mass halos, 
and observational selection effects, including interloper galaxies, “dark bursts” potentially 
missing from our sample, and publication bias. Our conclusions are presented in Section 
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2. Observations 


Presently, measured redshifts for GRB afterglows are known for more than 400 GRB^ 
among those 65 (as of February 2015) in the redshift range 3 < z < 5. Of those, host 
measurements for 31 are reported in the literature and in the GRB Host Studies (GHostS) 
databas^ at widely disparate sensitivities. We add our own data, in particular FORS2/VLT 


(Appenzeller & Rupprecht 1992) observations of 7 GRB hosts, and GROND/2.2m (Greiner 


et al. 2007, 2008) observations for another 6 GRB hosts. 


Our VLT observations of GRB host galaxies at 3 < z < 5 are summarized in Table 
FORS2/R-band observations are taken as a sequence of 1140 s integrations, while FORS2//- 
band observations are taken as a sequence of 240 s integrations, owing to the brighter sky 
in that bandpass. Observations were carried out in service mode, extending over two or 
more nights in each case, and were almost all of high quality. The raw data are available for 
downloading from the ESO VLT data archiv^ Pointings were chosen to put the targeted 
region of sky in a clean portion of GGD 2 on FORS2, southwest of the pointing center. 


GROND observations (also listed in Table were taken as a sequence of 369 s exposures 
using four or six telescope dither positions. Imaging was done in all seven GROND hlters 
simultaneously; for each target we report the observed magnitude for the bluest hlter redward 
of, and not affected by, Ly-a. 


2.1. Data Reduction 


After bias-subtraction and flat-helding, VLT images are trimmed to exclude the vi¬ 
gnetted corners of the FORS2 held of view; GGD 1 images are trimmed to a 3.'53 by 2.'01 
rectangular region, and GGD 2 images to a 3.'53 by 1.'50 rectangular region. 


Individual GGD images are processed with SExtractor ( Bertin fc Arnouts|1996 ) in order 
to identify and mask bright objects and ht and subtract a smoothly-varying model of the 
sky background. We observe mild fringing at a level of <1% of the sky background which 
is not reliably repeated from one observing epoch to the next. For this reason, we construct 
an object-masked fringe image for each observing epoch and scale and subtract it from the 
images before proceeding. Using the native WGS, the two GGD images are then integrated 


Uee the online collection at http://mpe.mpg.de/$\sim$jcg/grbgen.html 
^GHostS: http: //www. grbhosts . org/ 

WLT data archive: http://dataportal.eso.org 
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into a single image with an 18-pixel (2728) gap between the two helds. 


Coaddition of the multiple frames for each held proceeds via a two-step process. In the 
hrst step, images are aligned via cross-correlation analysis and a median stack is constructed. 
This median image is used to identify bad pixels and cosmic ray-affected pixels in each 
individual frame, updating the previous (static) bad pixel masks for each detector. Masked 
pixels in each image are replaced by corresponding values from the median image, and the 
images are re-aligned with a second cross-correlation analysis. The hnal coadded image is 
then generated as the sigma-clipped mean of the masked and aligned individual frames. The 
coadded image is trimmed to the size of the best-centered individual frame, yielding a single 
3.'53 by 3.'55 image with roughly constant depth across the held. We then rehne the WCS 
of this hnal image by reference to the SDSS (when available) or USNO-Bl.O catalogs; the 
resulting WCS solutions have uniformly acceptable residuals of <072 (rms). 


These residuals are due to the mapping against the catalog positions. Astrometry 
between two GROND images (for afterglow and host, respectively) or between GROND 
and FORS2 is even more accurate. In order to determine the position of the afterglow 
relative to the host galaxy, the following procedure is used. First, a source detection is run 
over the GROND images containing the afterglow, providing a list of x,y coordinates of the 
afterglow and neighbouring held stars (extended objects were de-selected). This list is then 
used as “catalog” input list for the astrometry of the FORS2 images. This provides relative 
accuracies of <0705 and <071 (rms) for GROND-internal and GROND-FORS2 mapping, 
respectively. In Fig. [T]the position of the afterglow is plotted with a cross, while the circle 
denotes the aperture used to extract the host photometry. 


Photometric analysis is done using standard IRAF tasks (Tody 1993), as documented 
in full in Kriihler et ah (2008). In short, bright stars within the held are used to ht a 
point-spread function (PSF) which then is applied to the entire held. Absolute photometric 
calibration of the FORS images was achieved with GROND by observing a Sloan digital Sky 
Survey (SDSS) held ( Aihara et aH20lf| that was closest to the host galaxy, and consecutively 
the corresponding host held (in few cases this was not needed as the GRB lies already in 
SDSS-covered area). 


With accurate afterglow positions registered on our VLT and GROND frames, and using 
the well-known <1" ohsets of long-duration GRBs from their hosts (Bloom et ah 2002), we 
are hnally in the position to search for a host related to the corresponding GRB afterglow. 
Simple aperture photometry is used for the host galaxy. For each galaxy a curve-of-growth 
analysis was carried out to determine the aperture size for which the signal-to-noise ratio 
is optimized (also added as separate column in Table [^. We also list the centroid 
positions of the host galaxies in Table A selection of stars for each host held were 














then used to determine the associated zero point. 


2.2. Results and Source Notes 


Fig. [T] shows the hnal image of each of our sample GRBs. Table [T] contains the details of 
the observations, including the GRB name (column 1), the redshift (2), telescope/instrument 
(3), hlter (4), exposure (5), centroid position of the host (0"3 uncertainty) (6), the 
aperture size as determined from the curve-of-growth analysis (7) and the host 
magnitude with uncertainty, reporting a 2cr upper limit for non-detections (8). The three 
GRBs with explicit errors in the redshift column have photometric redshifts. 

With respect to the data reduction for individual GRB positions, we have the following 
notes: 


GRB 090313: The galaxy 2!'5 North of the GRB afterglow position is not the host 
galaxy. Using Le PHARli[^ ([Arnouts et ah 


1999; Ilbert et ah 2006), the spectral energy 


distribution obtained with GROND over all seven optical-|-NIR channels is best £t with an 
Sbc spiral galaxy template at a photometric redshift of 2;phot = l.OiO.l. This is much smaller 
than the GRB afterglow redshift, and also lower than the redshifts of the two intervening 


absorber systems reported by de Ugarte Postigo et al. (2009). The limiting magnitude of 


our GROND image is about IPS deeper than reported in Table but the halo of this nearby 
galaxy produces additional background light at the afterglow position. 

GRB 120805A: Based on the combination of GROND afterglow photometry and X- 
Shooter host spectroscopy, Kriihler et al. (2015) derive a photometric redshift 2 ; ~ 3.1. 


Table presents further data on these observed GRBs, along with our compilation 
of GRB host measurements from the literature. Data columns present the GRB name and 
redshift (columns 1 and 2), the host magnitude or upper limit in the measured hlter band (3), 


the Galactic foreground extinction from Schlahy & Finkbeiner (2011) (4), and the distance 
modulus for our chosen cosmology (5); the derived value of the spectral slope [3 (6), absolute 
UV magnitude Muv (7), and inferred star formation rate (8); and appropriate references 
(9). 


^LePHARE website: http: //www. cfht .hawaii . edu/$~\sim$arnouts/LEPHARE 
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Right Ascension (J2000) 

Fig. 1.— Finding charts of our new GRB host observations; the GRB name, instrument 
and filter band are labeled in each chart. North is up, and East to the left. The circle is 
centered on the host (if detected) or on the afterglow position, and denotes the aperture 
used to extract the photometry. The cross indicates the afterglow position. 

2.3. Host Galaxy Properties 

We derive host galaxy properties (Table from the observed magnitudes and upper 
limits as follows: 


1. We correct observed magnitudes for the effects of Galactic extinction, applying the 


extinction law of Gardelli et al. (1989) with Ry = 3.1, using the dust maps of Schlafly 


& Finkbeiner (2011) to estimate E{B — V) for each line of sight; 


2. We calculate the absolute magnitude at the observed rest-frame wavelength by applying 
the distance modulus for our chosen cosmology (ifo=70 km/s/Mpc, Gm=0.3, Ga=0.7); 

3. We solve jointly for the absolute magnitude of the host galaxy at the rest-frame wave¬ 
length A = lOOOA, Muv, and the spectral index of its continuum, {3 {f\ oc A^), using 
the relation for star-forming galaxies at {z) = 3.8 from Table 3 of Bouwens et al. 


(2014b): 


/5 =-1.85-0.11(Muv +19.5). (1) 

Note that while this relation exhibits substantial scatter, uncertainties for 
the fitted parameters over ensembles of high-redshift star-forming galaxies 
are modest, /3_i9.5 = —1.85(1)(6) and d/^/dMuy = —0.11(1). 
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Our procedure for upper limits is as follows: If our upper limit implies Muv > 
— 18.1 mag (strong constraint) then we adopt the value appropriate for a galaxy with 
absolute magnitude equal to our upper limit. If our upper limit is at Muv < —18.1 mag 
(weak constraint) then we adopt f3 = —2 exactly. Note that for the latter observations, 
our quoted Muv limits and our observed upper limits (at rest-frame wavelength Aobs) 
will be the same. 


4. In a handful of cases (GRBs 000131, 060223A, 060605, 081029, 090323, 090519, 100518A, 
130408A) observations of host galaxies at z > 3.5 are attempted in the i?-band or simi¬ 
lar filters, which extend over the Lyman-alpha feature at these redshifts. We correct for 
absorption blueward of the Lyman-alpha transition by integrating the adopted power- 
law continuum over the hlter bandpass, assuming a uniform 60% suppression of the 
host galaxy continuum at wavelengths blueward of Lyman-alpha. This ~1 mag of UV 
continuum suppression is typical for high-redshift LBGs (e.g., [Ando et ah' 2004). The 
corresponding Muv values are marked with an asterisk in Table 

5. We estimate the host galaxy star formation rate from the calculated spectral slope /3 
and absolute magnitude Muv by assuming that the UV extinction Aieoo = 4.43 mag -|- 
1.99/? (Meurer et al.||1999), and that the total star formation rate is proportional to the 


extinction-corrected UV luminosity (Eq. 11 of Duncan et ah 2014). For consistency, 
we quote estimates from each single-band observation, even for hosts with observations 
in multiple filters or with previously-published SFR estimates. Extinction values are 
required to be non-negative; only the two faintest host limits are affected by this 
restriction. 


We note that since observed spectral indices of high-redshift star-forming galaxies are 
close to /? ~ —2 ( Bouwens et al.[ 2014b), their restframe UV spectra are nearly flat in 
with AB magnitudes that are close to constant redward of the Lyman-alpha transition. 
Further, we note that Duncan et ah (2014) found the adopted star-formation estimator 
satisfactorily reproduced the results of full spectral energy distribution analyses (to within 
<0.1 mag) for high-redshift galaxies in GANDELS GOODS South. On the other hand, we 
recognize that the SFR estimates for galaxies detected in multiple bands are not always 
consistent; GRB 110818A represents an extreme case, with Muv and SFR estimates from R 
and /-band observations that differ by more than 3 ct; the spectral slope between the two 
observations is /? = -1-0.28, very different than the /? ~ —1.6 expected on the basis of the 
galaxy’s absolute magnitude (although this relation is observed to have significant scatter, 
e.g. Fig. 7 of Bouwens et al.|[2014b[ ). 


Among the six targets with photometric redshift (or similarly low accuracy) estimates, 
the error on the derived absolute magnitude across the likely redshift range is less than 
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0.3 mag for all but two. For the two exceptions, GRBs 100518A and 000131, this is due to 
the uncertain impact of Lyman-alpha absorption in i?-band observations. Since GRB 000131 
has an /-band limit which we use for our host studies, only the single limit for GRB 100518A 
(one of 29 detections and 15 limits) is affected in a signihcant way by photometric redshift 
uncertainty. 


As a consequence of the observed correlations between Muv, /d, and Aigoo specihed 
above, we hnd that (except where the condition of non-negative extinction applies) Aieoo = 
0.75 — 0.22(Muv + 19.5) and hence Muv,int = 1.22 Muv + constant. Thus we expect the 
intrinsic UV luminosities of the host galaxies to scale as Tuv,int cx: Tyv for bright galaxies, 
Afuv < —16.1 mag. In the complementary regime, Muv > —16.1 mag, the expected extinc¬ 
tion is zero, and intrinsic and observed UV luminosities will be equivalent. Hence, since SFR 
is proportional to Luv,int, we adopt the above SFR weighting to convert from the observed 
luminosity function of LBG galaxies in the held to predicted luminosity functions for GRB 
host galaxies (see |:3.1). 


2.4. Comparison to the Literature 

In general, the absolute magnitudes derived here from previously published magnitudes 
and limits are in good agreement with those already published, in particular for hlters located 
close to rest-frame 1600A (corresponding to Aobs = 6400A at z = 3 and Aobs = 9600A at 
= 5). For longer-wavelength hlters, the ehect of the luminosity-dependent slope relation 
that we have adopted introduces corrections which range from 0.2 to 0.4 mag. 

Three GRBs from our sample have published star-formation rates: 


GRB 971214: compared to our SFR of 9.0T1.3 MqJt Kulkarni et ah (1998) report 
5.2 MqYt~^ unobscured SFR, and Perley et ah (2013) report M©yr“^. 


GRB 080607: compared to our SFR of 2.1±1.0 MqJt ^ Perley et ah (2013) report 
19 . 1 +^-^ 


^- 4.9 Mq yr 


-1 


GRB090323: compared to our SFR of 26.3±3.9 M© yr Savaglio et ah (2012) estimate 


6.4 M©yr ^ obscured SFR from the same GROND r'-band imaging. 


The difference for GRBs 971214 and 080607 is likely due to the different 
method used, i.e. the SED fitting employed by Perley et al. (2013) For GRB 
090323, the difference is nearly solely (factor 3.0 out of the total factor 4.1 
difference) due to the /9-slope dependent extinction-correction which we (but 
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not Savaglio et al. 2012) have applied; the two approaches are consistent to 
within a factor of two if the same method is used. The case of GRB 090323 also 
demonstrates the relatively small residual effect (remaining difference of ;^50%) 
of even a relatively extreme spectral slope (—1.6) as derived via our approach. 


3. Analysis &: Interpretation 

We now proceed to analyze onr dataset of GRB host galaxy properties in light of cur¬ 
rent models of galaxy formation and GRB production, addressing astrophysical biases and 
possible selection effects. 


3.1. Host Galaxy Luminosity Function 


First, we use our collection of 3 < 2 ; < 5 GRB host galaxy luminosities and upper limits 
to estimate the GRB host galaxy luminosity function and compare to recent survey work on 
z > ?> galaxies using deep- and wide-held imaging observations from HST. For each burst we 
choose the single best estimate or upper limit from Table preferring detections over lim¬ 
its, and higher-precision measurements (e.g., HST data) over lower-precision measurements, 
and selecting the shortest-wavelength hlter redward of Lyman-alpha to minimize systematic 
uncertainty. 


With the resulting set of 29 Muv measurements and 15 lower bounds, we use Kaplan- 
Meier estimation ( Kaplan fc Meierjl958 ), as implemented in the R statistical software package 
( Therne^|2012 R Gore Team||2013 ), to construct the maximum-likelihood LF for GRB host 
galaxies over 3 < 2 ; < 5, along with its 50%-conhdence and 90%-conhdence ranges. Our 
results are presented in Fig. (upper panel), along with a histogram of the input detections 
and limits (lower panel). As common in astronomical applications of the Kaplan-Meier 
algorithm, the requirement that the “survival” of any individual data point be unrelated to 
its true value is not strictly met, since the intrinsically-faintest galaxies are less likely to be 
detected. However, as also common in astronomical situations, the broad range in source 
redshifts and the range in achieved limits across the various observations serve to randomize 
the distribution of detections and limits somewhat. As such, we are conhdent that the 
resulting characterization of the LF is roughly as good as any non-parametric analysis of the 


data can offer (see Feigelson & Babu 2012 for discussion). 


(2015b 


For comparison purposes, we use the {z) = 3.8 luminosity function of Bouwens et al. 
the “All Fields” fit from Table 6). This luminosity function results from their 
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Fig. 2.— Luminosity function (LF) for GRB host galaxies over 3 < z < 5 and comparison to the Lyman- 


break galaxy (LBG) LF at (z) = 3.8 (Bouwens et al. 

2015b 1 and expectations from semi-analytic models 

incorporating anti-metallicity bias in GRB production 

(Trenti et al. 

2015 

). Top panel: Cumulative Muv 


LF for 3 < z < 5 GRB host galaxies, as derived from our detections and upper limits using Kaplan-Meier 
estimation. The maximum-likelihood luminosity function is plotted as a blue line, with the 50%-confidence 
region shaded in grey and 90%-conhdence upper and lower bounds plotted as black lines with points. It is 


compared to the cumulative star formation rate (SFR)-weighted LF of (z) = 3.8 LBGs from Bouwens et al. 


(2015bI (magenta line), and to three predicted z = 3.75 GRB host galaxy luminosity functions 


from Trenti et al. (2015| (red lines). The strength of anti-metallicity bias in GRB production 
is characterized by the p parameter: p = 0 implies no GRB production at metallicities greater 
than ^0, while p = 0.2 (1.0) implies suppression by a factor of six (two) by comparison to the 
GRB production rate at metallicities less than 1Q~^Zq. The GRB host galaxy LF is consistent 
with the unbiased SFR-weighted LBG LF, borderline-consistent (at 90%-confidence) with the 
p = 1 model, and inconsistent at >90% confidence with the two more biased {p = 0.2 and p = 0) 
models. Bottom panel: Histogram of 3 < z < 5 GRB host galaxy Muv absolute magnitudes and lower 
limits, as presented in Tablej^ along with the SFR-weighted LBG LF at (z) = 3.8 (magenta line) and GRB 
host galaxy LFs from [^enti et al. (20151. 
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3-3 ^ z ^ 4.5 photometric redshift selection of Lyman-break galaxies (LBGs) over the 
XDF, HUDF09-Ps, CANDELS (North + South), and ERS HST survey helds, and reaches 
to Muv = —16 mag. A Schechter-function form ( Schechtei]|1976 ) is assumed, and the best-fit 
parameters with uncertainties are: M* = —20.88(8) mag, 0* = 1.97(31) x 10“^ Mpc“^, and 
a = — 1.64(4)[ii| In this context, we note that the median and mean redshifts of our sample 
are 3.57 and 3.72, respectively. 

To compare the LEG and GRB host galaxy luminosity functions, we must account for 
GRB selection effects. Our default hypothesis, consistent with the collapsar model, is to 
weight galaxies according to their instantaneous star-formation rates. Here we use results 
established previously ((:2.3), showing that the star formation rate scales as SFR oc Tyv 


for Muv < —16.1 mag, and as SFR oc Luv for Muv > —16.1 mag. The effect of an SFR- 
weighted selection of galaxies on the luminosity function is thus to increase the a parameter 
by 1.22 (1.00) to ospr = —0.42 (—0.64) over these two regimes, without altering M*. Note 
that (p* is irrelevant in the present context as our sample is fixed at 77 = 44 galaxies. 


Overplotting the cumulative SFR-weighted LBG LF from Bouwens et ah (2015b) 


ma¬ 


genta line) against our cumulative LF for the GRB host galaxies (blue line) shows that the 
two are consistent across the full range of absolute magnitudes probed (Fig. |^. In particu¬ 
lar, the LBG LF stays within the 90%-confidence bounds (black lines with points) 
on the GRB host galaxy LF for 38 of 42 plotted points (exceeding the constraint 
for four points at Muv < —20.5 mag), and lies within the 50%-confidence inter¬ 
val (shaded region) for 33% of this range. Exploring the degree of agreement 
quantitatively, we find that the LBG LF is consistent with our observations at 
roughly the Icr (68%-confidence) level. 

Given consistency with this straightforward model, we do not find evidence for complex 
or second-order selection effects for GRB host galaxies in this redshift range. Effects that 
have been proposed in this context include metallicity-dependent selection (biasing toward 
lower-mass galaxies; Fruchter et al.|20^; Modjaz et al.[2008; Wang &: Daij2014) and selection 


against the most luminous, dusty hosts (via extinction of the optical afterglow light; Per ley 


et ah 2009; Kriihler et ah 2011 Perley et al. 2013). By contrast, our results are consistent 


with predictions that metallicity or anti-extinction effects in the GRB population will yield 
minimal selection bias for GRB host galaxies at z > 3 (Kocevski et al. 2009 Mannucci 


et al. 

2011 

Perley et al. 

2013 


We explore the question of anti-metallicity bias 


I We note that the best-fit LBG LF parameters changed between initial preprints of 


Bouwens 


et al. (2015b) and the published version, owing to an improved treatment of the survey selection 


volume; the original submitted version of the present manuscript used the initial parameter 
values. 










































17 


quantitatively in Sect. 3.2 below, comparing our host galaxy LF to expectations 
under a range of scenarios (their “plateau parameter” p = 0, 0.2 and 1.0) for 


anti-metallicity bias explored by Trent! et al. (2015). These models are plotted 
as red lines in Fig. 

We note that our results provide independent evidence for host luminosity-dependent 
obscuration as observed both at z ~ 3 (Meurer et al. 1999) and in higher-redshift LEG 


studies (Bouwens et al. 2014b Duncan et al. 2014). In particular, if we assume that SFR is 


proportional to observed UV luminosity Luv across the full range of galaxy luminosities, the 
resulting LF is inconsistent with our GRB host galaxy LF at >90%-con£dence, lying to the 
right of the illustrated 90%-conhdence interval for 35 of 42 plotted points (and between 


the plotted LFs for the p = 0.2 and p = 1.0 models from Trent! et al. 2015) 


Comparison to recent work Since submission of this manuscript, a separate 
study of GRB host galaxy luminosity functions over 0 < z < 4.5, including com¬ 
parison to expectations from galaxy surveys and semianalytic models, has been 


posted to the ArXiv (Schulze et al. 2015). These authors study the 69 GRB 


positions defined by the TOUGH sample (Hjorth et al. 2012), a subset of which 
are included in our compilation. Since the authors do not perform a Kaplan- 
Meier analysis, the effects of host galaxy nondetections (limits) on their LFs are 
not clear, and they restrict their analysis to the depth of the faintest detected 
host galaxy in each redshift interval, foregoing the opportunity to test their LFs 
from the faint end. Nonetheless, they compare their resulting empirical LFs to 


LBG LFs and the semi-analytic LFs from Trent! et al. (2015), as we have (see in 
particular their Fig. 7). 


Schulze et al. (2015) find evidence for significant anti-metallicity bias over 
t) < z < 1 (n = 13 galaxies) and 3.0 < 2 ; < 4.5 (n = 9 galaxies). Their empirical LF 
over 1.0 < < 1.5 (n = 7 galaxies) is consistent with all tested models, while the LF 

over 1.9 < 2 ; < 2.7 (n = 22 galaxies) disfavors models with strong anti-metallicity 
bias (p = 0 and p = 0.04). Over the two higher-redshift intervals, their results are 
subject to significant uncertainty dne to the unknown redshifts of nine of their 
targeted GRBs, six of which have detected host galaxies. Including a significant 
number of these host galaxies in the 1.9 < ^ < 2.7 redshift interval would shift the 
LF for this interval to fainter magnitudes, increasing agreement with moderate 
and strongly biased models, while including a significant number in the 3.0 < 
2 ; < 4.5 interval (owing to the greater implied host galaxy luminosities at these 
redshifts) would shift the LF for this interval to brighter magnitudes, increasing 
agreement with the less-biased models (including simple SFR-weighting or no 
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bias, p —)■ oo). 

Given the uncertainties, distinct statistical approach, and relatively small 
sample size within the redshift range considered here, we do not consider the 
results of Schulze et al. (2015) to be in significant tension with ours. To the con¬ 
trary, we think they demonstrate (along with our present work) that GRB host 
galaxy studies have advanced to the point where LF comparisons over a broad 
range of redshifts are providing useful insights into the host galaxy population, 
and likely, the GRB mechanism itself. 


3.2. Astrophysical Effects 


The consistency we have demonstrated between the GRB host galaxy LF over 3 < ^ < 5 
and the most straightforward predictive model - the SFR-weighted LF of (z) = 3.8 LBGs 
from Bonwens et al. (2015b) - snggests that astrophysical biases in selecting GRB host 
galaxies at these redshifts are either minor or connterbalanced by competing astrophysical 
or selection effects. 


Of conrse, it may be the case that these astrophysical biases are, indeed, present and 
connterbalanced. Even if not, reviewing past and cnrrent predictions as to the natnre and 
strength of these effects, and comparing to the constraints implied by onr data, may help 
to place GRBs and their host galaxies in their fnll cosmic context. We therefore review 
these qnestions here, considering first proposed astrophysical biases and then the most likely 
observational or selection effects. 


Metallicity bias: The effects of host galaxy metallicity on GRB prodnction have been 


mnch debated. From the standpoint of the collapsar model of long-dnration GRBs (Woosley 


1993 MacFadyen fc Woosley|1999 ), the metal content of a massive star’s envelope is a strong 


driver of mass and angnlar momentnm losses. Expnlsion of the hydrogen envelope before core 
collapse is necessary to rednce the star’s size to the point where the GRB jet can snccessfnlly 
escape the star while the central engine is active. At the same time, if angnlar momentnm 
losses from the envelope are transmitted to the core, they will prevent a long-lived accretion 
disk from forming, drastically redncing the lifetime of the central engine. As a result, isolated 


high-metallicity stars are not expected to explode as GRBs (Woosley & Heger 2006 Yoon 


et al.||2006). The strength of this bias could be mitigated, however, if binary stellar evolution 


can produce GRB progenitors more or less independent of metallicity (Fryer & Heger 2005 


Gantiello et al. 2007). 


From an observational perspective, a bias toward low-metallicity environments finds 
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support from the lowest-redshift “low luminosity” GRBs, which have been found exclusively 


in low-metallicity host galaxies and regions of those galaxies (Stanek et ah 2006 Levesque 


et ah 2011); from metallicity studies of GRB host galaxies at z ^ 1 (Levesque et ah 2010a 


Graham & Fruchter 2013) and beyond (Kriihler et ah|2015); from comparative host galaxy 


studies of GRBs, type Ibc, and type II supernovae at z ^ 2 (Fruchter et ah 


et ah 2008); and from GRB host galaxy photometric surveys at z ^ 2 (Perley et ah 2013) 


2006 


Modjaz 


and beyond (Fynbo et aL||2003; Hjorth et ah||2012). However, contrary indications also exist: 


for example, not all GRB host galaxies are low-mass or metal poor (Savaglio et ah 2009 


Levesque et ah|2010b [Savaglio et ah|2012 Elliott et aL|2013 Schady et ah|2015 Kriihler et ah 


2015), and gas-phase metallicities derived from GRB absorption spectroscopy are consistent 


with average galaxy metallicities for star-forming galaxies at similar redshifts (Prochaska 


et ah||2007 Gucchiara et ah||2015 ). Moreover, with respect to low-luminosity GRBs it should 
be noted that these are not, strictly speaking, GRBs - their much lower luminosities can 
be accommodated within mildly-relativistic shock-breakout models, without the need for 


highly-relativistic jets (Nakar & Sari 2012 Katz et ah 2012 Barniol Duran et ah 2015). 


In an attempt to shed light on these issues, Trenti et ah (2015) extended their semi 


analytic models of galaxy formation to incorporate metallicity-sensitive GRB production 
rates. They explored a range of models, from zero to near-maximal sensitivity, by including 
metallicity-sensitive (MS) and metallicity-insensitive (MI) channels of GRB production in 
a variable ratio. The redshift distribution and host galaxy properties of the GRBs in each 
scenario are then compared to observations. Importantly, with these models they address, 
for the first time and in realistic fashion, the effect that the evolving metallicity distribution 
of star-forming regions, in galaxies of various sizes, will have on the metallicities, masses, 
and luminosities of GRB host galaxies across the full range of observed redshifts. 


Trenti et ah 

(2015 

) find that the Swift GRB redshift distribution ( 

Wanderman & Piran 


2010) is best reproduced by models that incorporate contributions from both MS and MI 


production channels. In their maximum-likelihood scenario, most GRBs at z result from 
the MI channel and hence reflect the typical metallicities of star-forming regions; GRBs are 
produced at comparable rates in the two channels at z ^ 3 and show the strongest differential 
preference (compared to total star-formation metrics) for low-metallicity regions; and GRBs 
are produced primarily via the MS channel at z 5, when nearly all star-formation is 
occurring in metal-poor environments. 


We have compared our host galaxy LF to the LF predictions of Trenti et ah (2015) at 


= 3.75 (Fig. 1^. Our LF is consistent with their predictions for the pure-MI scenario and 
inconsistent at >90%-con£dence with the pure-MS scenario (with their “plateau” variable 
p = 0), as well as with two mixed scenarios (p = 0.04, not plotted, and p = 0.2) that incor- 
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porate both MS and MI GRB production. It is marginally consistent (not excluded 
at >90%-confidence) with a further p = 1 mixed scenario that was generated 
specifically for this work. The plateau variable sets the relative efficiency of GRB 
production via the MI channel at metallicities Z > Zo- GRB production is suppressed by 
a factor of (1 + p)/p (e.g., by 6x for p = 0.2, and by 2x for p = 1) for Z > Zq, compared 
to its rate at Z < 10“^ Zq. In a pure-MS scenario, p = 0 and there is no GRB production 
at Z > Zq, while as p —)■ oo GRB production is independent of metallicity. In models 
with hnite p, the GRB production efficiency at intermediate metallicities 10’ ^ < Z/Zq < 1 


decreases monotonically with increasing metallicity; see Trenti et ah (2015) for details 


Gonsistency with the pure-MI model is expected, as it is constructed in part on the LBG 


survey work of Bouwens et ah (2015b) which provides our successful default LBG LF (^3.1). 


The incompatibility of the p = 0.2 mixed model with our observed LF is more surprising, 
as it successfully reproduces the GRB redshift distribution. Yet our sample includes a 
significantly greater proportion of bright host galaxies than this model predicts, and is 
barely consistent with the significantly reduced bias of the p = 1 mixed model. 
We conclude that a dominant role for the MS channel in GRB production at 3 < z < 5 can 
only be supported if some distinct astrophysical or selection effect is acting to enrich our 
sample with bright host galaxies. 


Suppression of star formation in low-mass halos: In fact, the possible existence and 
likely implications of one such astrophysical effect have been central to several earlier studies 
relating GRB production to high-redshift star formation (Trenti et al. 2012, 2013). These 
models assume that SF at z > 3 is efficient in dark matter halos only above a certain mini¬ 
mum mass, leading to a Schechter-like LF up to a corresponding maximum (faintest) Muv, 
beyond which the LF cuts off sharply. This limiting “suppression magnitude” Msupp has 
been taken to be —12 mag in the so-called standard scenario, corresponding to a minimum 
halo mass of 10 ® Mq. A signihcantly larger minimum halo mass for successful star forma¬ 
tion would bias the LF to brighter magnitudes and increase the fraction of detected host 
galaxies at a given flux limit. Hence, the allowed range for Mgupp can be constrained using 
observations of host galaxies at substantially brighter magnitudes, Muv ^ M^^pp. 


Such an analysis has previously been applied by 

Trenti et ah 

(2012 

) to the deep limits 

for six z > 5 GRB host galaxies reported by 

Tanvir et ah 

(2012 

); they showed that the 


luminosity function at those redshifts must extend to at least Msupp > —15 mag (90%- 
conhdence limit). With respect to the present dataset and our lower-redshift range, 3 < 
z < 5, the non-detection of two host galaxies down to Muv > —15.52 mag (GRB060607A 
aX z = 3.075) and Muv > —15.72 mag (GRB 020124 at z = 3.198) provides a strong lower 
bound, Msupp > —15.52 mag, that is almost as constraining as the previous z > 5 result. 
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Because a non-negligible fraction of SF happens in faint galaxies for any Schechter-like 
LF, values of Mgupp that are consistent with our lower (bright-end) bound can still alter 
the predicted GRB host galaxy LF in a significant fashion. For example, the p = 0.2 LF 
for z = 3.75 galaxies from Trenti et al. (2015) is consistent with (not ruled out at >90%- 
conhdence by) our results for Msnpp < —15.6 mag. While this particular case is excluded 
(in a distinct manner) by our faintest limits, plateau parameter values 0.2 < p < 1, 
excluded at >90%-confidence for M^upp = —12 mag, would be consistent with our 
data for values of Msnpp > —15.5 mag which are allowed by our deepest limits. 


3.3. Potential Observational Biases 


We now turn to consider potential biases in the observations, independent of astrophys- 
ical effects, which may affect our results. We have identified three such potential biases, and 
address each of these in turn. 


Interloper galaxies: Some of the identified galaxies, while coincident with the GRB 
positions, may not actually be the GRB host galaxies; we refer to this as the interloper 
problem. When interlopers are present, the sample will be contaminated with galaxies 
having greater inferred luminosity than the (unobserved) true host galaxies. The nature of 
this effect, then, is to bias the observed LF to brighter magnitudes. 


We estimate the magnitude of this effect by reference to R-band galaxy number counts 
from Boutsia et ah (2014). We find that for a 1" radius localization, there is a 1% chance for 
an interloper galaxy with Rab magnitude within 0.25 mag of Rab = 25 mag; a 2% chance for 
an interloper galaxy within 0.25 mag of 26 mag; a 5% chance for an interloper galaxy within 
0.25 mag of 27 mag; and a 14% chance for an interloper galaxy within 0.25 mag of 28 mag. 
By its nature, the interloper effect is not relevant for (cannot affect) our upper limits. 


Our faintest detection is = 28.07(20) mag for the candidate host galaxy of 

GRB 060605; however, this galaxy has a brighter R-band detection from the ground at R = 
26.4(3) mag. The next two faintest detections are mnow = 27.96(15) mag for GRB060223A 
and R = 27.76(45) mag for GRB 050908. These two detections should be compared to the 
~12% chance of an interloper of similar magnitude; on a purely probabilistic basis there is a 
78% chance that both are true associations and a 22% chance that one or both are false. Our 
brighter detections are less likely to be interlopers; the estimated total number of interlopers 
across our full sample of 29 detected host galaxies is 0.85. 


If one or more of our detected galaxies are interlopers, identifying and correcting for 
this would convert one, or several, of the fainter detections (e.g., those identified above). 
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into upper limits. Given the sample size and the estimated number of interlopers, we do not 
expect this to have a dramatic impact on our derived LF. 

Moreover, we note that we have calculated interloper probabilities from a frequentist 
perspective, without taking into account the prior expectation that a significant fraction of 
GRB positions should have star-forming galaxies that will be detected via the reported obser¬ 
vations. A full Bayesian analysis, beyond the scope of the present paper, would be expected 
to yield a smaller contamination rate than we have calculated above, with a correspondingly 
weaker effect on the derived LF. 


Dark hurst host galaxies: Since we require a spectroscopic or photometric redshift to 
include a GRB in our sample, GRBs that experience high extinction in their host galaxies 
are more likely to be excluded. The known hosts of confirmed high-extinction [Ay > 1 mag) 
GRBs, almost all at z < 3, have been shown to be statistically more massive, more lumi¬ 
nous, and more evolved than the host galaxies of low-extinction bursts in this redshift range 
( Kriihler et al.||2011 Perley et al.||2013 ). LBG surveys also uniformly find that more massive 
and luminous galaxies are subject to greater amounts of dust obscuration than less massive 
galaxies (Duncan et ah 2014 Bouwens et ah 2015b). Hence, selection against optically- 


extinguished bursts may exclude some of the most luminous GRB host galaxies, and bias 
our derived LF toward fainter luminosities. In the present context, with an empirical 
LF that lies on the higher-luminosity side of the SFR-weighted LBG LF and 
all semi-analytic LFs incorporating effects of anti-metallicity bias (Fig. [^, de¬ 
tailed consideration of this correction may appear unwarranted. Nonetheless, 
our concern with all astrophysical and selection effects leads us to review this 
issue briefly here. 


The most ambitious survey of high-extinction host galaxies to date (Perley et ah 2013) 


- which hrst clarihed the extent to which z <Z hosts of high-extinction bursts are systemati¬ 
cally more massive and luminous than those of low-extinction bursts - shows a clear decrease 
in this trend with redshift (e.g., in their Fig. 7), with the distributions of high-extinction and 
low-extinction host galaxies becoming consistent with each other and with the distribution 
for star-forming galaxies by z pe 2 (see caption to their Fig. 8). Unless this trend reverses 
at higher redshift, this suggests that the host galaxies of high-extinction and low-extinction 
bursts will not be strongly distinguished over our redshift range of 3 < z < 5. Furthermore, 
the host galaxies of heavily dust-extinguished GRBs are not exclusively massive, with 50% 
of the sample included in Perley et ah (2013) having stellar masses M* ^ 10^° Mq, and less 


than 20% having M* 10^^ Mq. 


We attempt to quantify the number of host galaxies missing from our sample due to 
high afterglow extinction (defined as Ay > 1 mag in the host galaxy) by considering two 
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GRB samples that are relatively complete in terms of both redshift and Ay measurements. 


These are the GROND 4-hour sample (Greiner et ah 2011), consisting of 39 long-duration 


GRBs observed by GROND within 4 hours of the Swift trigger, and the BAT peak flux- 


selected sample (Salvaterra et ah 2012 Govino et ah 2013), made up of 58 GRBs, 11 of 


which are in the GROND 4-hour sample. Removing duplicates, the union of these datasets 
provides a sample of 86 GRBs, 78 of which have redshift measurements (91% completeness). 
Ten GRBs (13%) have 3 < z < 5, and one of these has Ay > Imag (GRB 080607 with 


Ay = 1.3mag; Perley et ah 2011). This gives us a lower limit of 10% on the fraction of 


GRBs at 3 < < 5 with Ay > 1 mag. If we assume that all eight bursts without measured 

redshifts are high-extinction bursts, with redshifts distributed similarly as for the bursts with 
measured redshifts, then one of these high-extinction bursts with unknown redshifts (and 
two or fewer, at 90%-conhdence) are from 3 < 2 ; < 5. This would imply an expected fraction 
of 18% of GRBs (<25%, at 90%-conhdence) over 3 < 2 : < 5 with Ay > Imag. 


Since two high-extinction bursts are included in our sample (GRBs 080607, Perley et ah 


2013 and GRB 060210, Genko et ah 2009), a high-extinction fraction of 18% would mean 


that we have missed ~7 high-extinction afterglows in our sample of 44 GRBs. If, as we 
expect, the host properties of high-extinction bursts in our redshift range are not strongly 
distinguished from those of low-extinction bursts, then the absence of these host galaxies 
from our sample will not signihcantly affect our results. Alternatively, if the hosts of high- 
extinction and low-extinction bursts are strongly distinguished over 3 < z < 5, then our LF 
has been biased to lower luminosities by this effect, and any correction wonld shift our 
LF toward even higher luminosities. 


Publication bias: The majority of our host measurements are drawn from the published 
literature, including a diverse array of efforts and facilities. This is advantageous in the 
sense that it provides coverage of many GRB positions, yields detections of multiple bright 
host galaxies, and provides a somewhat-randomized distribution of detections and upper 
limits, helping to make a Kaplan-Meier approach to LF construction more appropriate. On 
the other hand, it means the sample may be affected by publication bias: if upper limits 
from moderate-depth observations are considered less interesting, they may be less likely 
to be reported in the literature than detections. As a result, our sample could be depleted 
in moderate-depth upper limits by comparison to a hypothetical survey offering complete 
coverage to a uniform depth; absence of these upper limits would bias our derived LF toward 
brighter magnitudes. 


While we cannot guarantee that this effect is absent in our sample, we have taken care 
to search the Gemini, VLT, and HST archives for unpublished late-time imaging of GRB 
positions within our targeted redshift range; this process led (in part) to the present set of 
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coauthors, and to our presentation of five previously-unpublished HST observations from 
Svennson (2011). Moreover, our sample includes significant contributions from the pre¬ 


defined and complete survey efforts of Hjorth et al. (2012) and Perley et al. (2013). We thus 
expect the impact of future observations of GRBs within the targeted redshift range, with 
respect to this effect, to be modest. 


4. Conclusions 


We have presented deep ground-based imaging of thirteen GRBs over 3 < 2 ; < 5 (Fig.[^, 
yielding discovery of eight new GRB host galaxies along with five deep upper limits (Table [^. 
Gombining these results with published observations of 31 additional GRB positions, includ¬ 
ing 21 detected host galaxies, we have presented a comprehensive and uniform summary of 
the photometric properties of GRB host galaxies over 3 < 2; < 5 (Table [^. 


Adopting ultraviolet (UV) continuum spectral index (/3) and extinction (Aigoo) prescrip¬ 
tions for high-redshift star-forming galaxies from multiband Lyman-break galaxy surveys 


with the Hubble Space Telescope (HST; Meurer et al. 1999 Bouwens et al. 2014b Duncan 


et al. 2014, [Bouwens et al. 2015b[ ) , we have used our detections and limits to estimate host 


galaxy absolute magnitudes at A = I 6 OOA in the rest frame, Muv (quoted in AB mags), and 
their resulting inferred star formation rates. Adopting the single most useful detection or 
upper limit for each of 44 targeted host galaxies, we have constructed the luminosity func¬ 
tion (LF) for GRB host galaxies over 3 < z < 5 by Kaplan-Meier estimation, calculating the 
-likelihood LF as well as its 50%-confidence and 90%-confidence intervals (Fig. [^. 


maximum- 


We have compared the GRB LF to expectations from LBG surveys by constructing 
the SFR-weighted LF for (z) = 3.8 LBGs (Bouwens et al. 2015b). 

= 1.97(31) X lO-^Mpc-A 


This LBG LF has 


M* = —20.88(8) mag, 0* = 1.97(31) x 10“^ Mpc“^, and a = —1.64(4); our adopted 
SFR-weighting, appropriate if GRBs trace star formation, adjusts the power-law slope by 
Aa = -1-1.22 (-1-1.00) over Muv < —16.1 mag (Muv > —16.1 mag) owing to the luminosity- 
dependent extinction observed in LBG surveys ( Meurer et aLjl999 Duncan et al.j20l4 ). The 
resulting LF is compatible with our GRB host galaxy LF over the full range of host galaxy 
luminosities probed, from Muv = —22.5 mag to Muv > —15.6 mag, a range of more than 
600 X in host luminosity. Since GRB host galaxies are selected independent of host luminos¬ 
ity, our results demonstrate consistency of the two LFs - with respect to their Schechter- 
function form and M* and a parameters - well beyond the range of absolute magnitudes 
that has been probed directly. 


We have reviewed proposed astrophysical and observational effects that might bias our 
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dataset and resulting LF. If two or more of these effects are present in sufficient and counter¬ 
balancing strength, they might conspire to yield a misleading agreement between the GRB 
host galaxy LF and the SFR-weighted LF of the LEG population, which is meant to reflect 
UV metrics of star formation over this redshift range. 


In this context, we reviewed the issue of the proposed anti-metallicity bias of GRBs. 
Existing observational evidence of this bias has been gathered almost entirely at low-redshift, 
z ^3, via studies of GRB host galaxies (Levesque||2014 and references therein), and by com¬ 


parison of the redshift evolution of the GRB rate to other metrics of star formation (Kistler 


et ah 2008, 2009 Wanderman & Piran 2010; Robertson & Ellis 2012). At 2 ; 3 a much 


greater fraction of star formation occurs in low-metallicity environments, where the effects 
of such a bias might be reduced. Nonetheless, recent theoretical explorations demonstrate 
that, for plausible models of the bias, the GRB host galaxy LF can be signihcantly altered 
over the redshift range considered here (Trenti et ah 2015). We tested these models against 
our observed LF and found (in the absence of other biases) no evidence for a dominant 
contribution from any metallicity-sensitive GRB production channel over 3 < z < 5. 


Alternatively, suppression of star formation in low-mass halos could result in a brighter 


GRB host galaxy LF than predicted via naive extrapolation of a Schechter-type LF (Trenti 


et ah 2010, 2013). We do not hnd it necessary to invoke this effect; the two faintest host 


galaxy limits in our sample, Muv > —15.72 mag and Muv > —15.52 mag, imply directly 
that Msupp —15.5 mag, and are almost as constraining as those derived from deep HST 
observations of known z > 6 GRB positions (Tanvir et ah 2012 Trenti et ah 2012). It 
remains possible that this effect is present and compensating for the effects of a (relatively 
mild) anti-metallicity bias in the GRB host galaxy population. 


In terms of observational or selection effects, we have reviewed our sample for possible 
contamination by line-of-sight interloper galaxies, unrelated to the GRB, and concluded that 
there is some chance that one or a few of our faintest detections are spurious in this sense; we 
estimated that on average, 0.85 of the host galaxies in the present sample will be interlopers. 


The observational challenges of measuring spectroscopic or photometric redshifts for 
high-extinction {Ay > 1 mag in the host galaxy) GRB afterglows are real and likely to affect 
our sample; we estimate that we are missing GRBs as a result of these effects. If the 
properties of the host galaxies of these “dark bursts” are signihcantly different than those 
of the Ay < 1 mag population, this could affect our LF and conclusions. However, the most 
extensive existing survey of the host galaxies of high-extinction bursts has shown that the 
differences between the high-extinction and low-extinction host galaxy populations decreases 
with redshift, becoming quite modest hy z ^ 2 (Perley et al. 2013). 
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We have also considered whether our sample might be depleted in medium-sensitivity 
upper limits on GRB host galaxies owing to publication bias. Given our review of the 
Gemini, VLT, and HST archives, we consider this possibility unlikely, although it is difficult 
to rule out completely. 


Rather than invoke a hypothetical and delicate balance among two or more of these 
competing astrophysical and observational effects, we conclude, as the simplest interpretation 
of our results, that GRBs accurately trace UV metrics of cosmic star formation over ?> < z < 
5. As differential effects between GRB production and star formation are robustly expected 
to decrease with redshift (Trenti et ah 2015), our hnding strongly suggests that GRBs are 
providing an accurate picture of star formation processes from 2 ; ~ 3 out to the highest 
redshifts, z ^ 5. GRBs can thus be relied upon in these regimes to provide an independent 
check on SFR estimates from galaxy surveys, which must necessarily extrapolate their results 
to faint galaxies well beyond their detection thresholds. 


Looking ahead, this work can be strengthened in a straightforward manner by carrying 
out a complete survey of GRB host galaxies over 3 < z < 5. Such a survey would address the 
question of publication bias directly, and given that the present sample already yields inter¬ 
esting constraints on GRB production biases and star formation models, would be assured 
of providing even greater insights. 


Similar work at higher redshifts, 2 ; > 5, is also desirable. Given the challenges of 
gathering useful data at these redshifts from ground-based facilities, this survey work will 
most likely be carried out using HST. 


Meanwhile, detected GRB host galaxies in this redshift range can be subjected to deep 
multiband observations, which will lead to galaxy-by-galaxy dust extinction and stellar pop¬ 
ulation models. Targeted spectroscopic investigations will also be useful to provide a detailed 
picture of the environments and properties of the star formation occurring within this unique 
and - as we have shown - SFR-weighted selection of high-redshift star-forming galaxies. 


The authors acknowledge astrostatistical consulting support from E. Feigelson. DBF ex¬ 
presses appreciation to MPE Directors K. Nandra and R. Bender for administrative support 
of his sabbatical stay at MPE, where this collaboration was initiated. PS and MT acknowl¬ 
edge support through the Sofja Kovalevskaja Award to P. Schady from the Alexander von 
Humboldt Foundation Germany. RP acknowledges co-funding through NSF grant No. AST 
1414246 and HST-GO-13831.011-A. SK acknowledges support by DFG grant K1 766/16-1 
and SSc support by the Thiiringer Ministerium fiir Bildung, Wissenschaft und Kultur under 
FKZ 12010-514. SSa acknowledges support from the Bundesministerium fiir Wirtschaft and 
Technologie through DLR (Deutsches Zentrum fiir Luft- und Raumfahrt e.V.) FKZ 50 OR 





1211. KV acknowledges support by DFG grant SA 2001/2-1, and CD acknowledges support 
through EXTraS, funded from the European Union’s Seventh Framework Programme for re¬ 
search, technological development and demonstration under grant agreement no 607452. AC 
is grateful for support of his visit to MPE Carching. Partial funding for CROND (hardware 
and personnel) was generously granted from the Leibniz-Prize to Prof. C. Hasinger (DFC 
grant HA 1850/28-1). 


REFERENCES 

Aihara, H. et al. 2011, ApJS, 193, 29 
Andersen, M. I. et al. 2000, A&A, 364, L54 

Ando, M., Ohta, K., Iwata, I., Watanabe, C., Tamura, N., Akiyama, M., & Aoki, K. 2004, 
ApJ, 610, 635 

Appenzeller, I. & Rupprecht, C. 1992, The Messenger, 67, 18 

Arnouts, S., Cristiani, S., Moscardini, L., Matarrese, S., Lucchin, F., Fontana, A., & Cial- 
longo, E. 1999, MNRAS, 310, 540 

Atek, H. et al. 2015, ApJ, 800, 18 

Barniol Duran, R., Nakar, E., Piran, T., & Sari, R. 2015, MNRAS, 448, 417 

Berger, E. et al. 2002, ApJ, 581, 981 

Berlin, E. & Arnouts, S. 1996, A&AS, 117, 393 

Bloom, J. S., Kulkarni, S. R., & Djorgovski, S. C. 2002, AJ, 123, 1111 

Boutsia, K. et al. 2014, A&A, 563, A142 

Bouwens, R. J. et al. 2014a, ApJ, 795, 126 

— 2011, Nature, 469, 504 

Bouwens, R. J., Illingworth, C. D., Oesch, P. A., Caruana, J., Holwerda, B., Smit, R., & 
Wilkins, S. 2015a, ArXiv.org, 1503.08228 

Bouwens, R. J. et al. 2014b, ApJ, 793, 115 


2015b, ApJ, 803, 34 



Bowler, R. A. A. et al. 2014, MNRAS, 440, 2810 

Burrows, D. N., Roming, P. W. A., Fox, D. B., Herter, T. L., Falcone, A., Bilen, S., Nousek, 
J. A., & Kennea, J. A. 2010, in Society of Photo-Optical Instrumentation Engineers 
(SPIE) Conference Series, Vol. 7732, Society of Photo-Optical Instrnmentation Engi¬ 
neers (SPIE) Conference Series, 1 

Cantiello, M., Yoon, S.-C., Langer, N., & Livio, M. 2007, A&A, 465, L29 

Cardelli, J. A., Clayton, G. C., & Mathis, J. S. 1989, ApJ, 345, 245 

Cenko, S. B. et al. 2009, ApJ, 693, 1484 

Chary, R., Berger, E., & Cowie, L. 2007, ApJ, 671, 272 

Chen, H.-W. et al. 2009, ApJ, 691, 152 

Covino, S. et al. 2013, MNRAS, 432, 1231 

Cucchiara, A., Fnmagalli, M., Rafelski, M., Kocevski, D., Prochaska, J. X., Cooke, R. J., & 
Becker, G. D. 2015, ApJ, 804, 51 

Cncchiara, A. et al. 2011, ApJ, 736, 7 

D’Avanzo, P. et al. 2010, A&A, 522, A20 

de Ugarte Postigo, A. et al. 2009, CRB Coordinates Network, 9015, 1 
Dijkstra, M. & Wyithe, J. S. B. 2012, MNRAS, 419, 3181 
Dnncan, K. et al. 2014, MNRAS, 444, 2960 

Elliott, J., Greiner, J., Khochfar, S., Schady, P., Johnson, J. L., & Ran, A. 2012, A&A, 539, 
A113 

Elliott, J., Khochfar, S., Greiner, J., & Dalla Vecchia, C. 2015, MNRAS, 446, 4239 
Elliott, J. et al. 2013, A&A, 556, A23 
Ellis, R. S. et al. 2013, ApJ, 763, L7 

Feigelson, E. D. & Babn, G. 2012, Modern Statistical Methods for Astronomy (Gambridge 
University Press) 

Ferrero, P. et al. 2009, A&A, 497, 729 



Fruchter, A. S. et al. 2006, Nature, 441, 463 
Fryer, C. L. & Heger, A. 2005, ApJ, 623, 302 
Fynbo, J. P. U. et al. 2003, A&A, 406, L63 
Gehrels, N. et al. 2004, ApJ, 611, 1005 
Graham, J. F. & Fruchter, A. S. 2013, ApJ, 774, 119 
Greiner, J. et al. 2008, PASP, 120, 405 

— 2007, The Messenger, 130, 12 

— 2009, A&A, 498, 89 

— 2011, A&A, 526, A30 

— 2012, Experimental Astronomy, 34, 551 
Hjorth, J. et al. 2012, ApJ, 756, 187 

— 2003, Nature, 423, 847 

Ilbert, O. et al. 2006, A&A, 457, 841 

Jaacks, J., Ghoi, J.-H., Nagamine, K., Thompson, R., & Varghese, S. 2012, MNRAS, 420, 
1606 

Jaunsen, A. O. et al. 2003, A&A, 402, 125 

Kaplan, E. L. & Meier, P. 1958, J. Amer. Stat. Assn., 53, 457 

Katz, B., Sapir, N., & Waxman, E. 2012, ApJ, 747, 147 

Kistler, M. D., Yiiksel, H., Beacom, J. F., Hopkins, A. M., & Wyithe, J. S. B. 2009, ApJ, 
705, L104 

Kistler, M. D., Yiiksel, H., Beacom, J. F., & Stanek, K. Z. 2008, ApJ, 673, L119 
Kocevski, D., West, A. A., & Modjaz, M. 2009, ApJ, 702, 377 
Kriihler, T. et al. 2011, A&A, 534, A108 

— 2008, ApJ, 685, 376 

— 2015, ArXiv.org, 1505.06743 



- 30 - 


Kulkarni, S. R. et al. 1998, Nature, 393, 35 

Laskar, T., Berger, E., & Chary, R.-R. 2011, ApJ, 739, 1 

Levesque, E. M. 2014, PASP, 126, 1 

Levesque, E. M., Berger, E., Soderberg, A. M., & Chornock, R. 2011, ApJ, 739, 23 
Levesque, E. M., Kewley, L. J., Berger, E., & Zahid, H. J. 2010a, AJ, 140, 1557 
Levesque, E. M., Kewley, L. J., Graham, J. F., & Fruchter, A. S. 2010b, ApJ, 712, L26 
MacFadyen, A. 1. & Woosley, S. E. 1999, ApJ, 524, 262 
Madau, P. & Dickinson, M. 2014, ARA&A, 52, 415 
Maiolino, R. et al. 2008, A&A, 488, 463 

Mannucci, F., Salvaterra, R., & Campisi, M. A. 2011, MNRAS, 414, 1263 
Marsden, D. et al. 2014, MNRAS, 439, 1556 
McBreen, S. et al. 2010, A&A, 516, A71 

Meurer, G. R., Heckman, T. M., & Calzetti, D. 1999, ApJ, 521, 64 

Mocanu, L. M. et al. 2013, ApJ, 779, 61 

Modjaz, M. et al. 2008, AJ, 135, 1136 

Mortlock, D. J. et al. 2011, Nature, 474, 616 

Nakar, E. & Sari, R. 2012, ApJ, 747, 88 

Nardini, M. et al. 2011, A&A, 531, A39 

Oesch, P. A. et al. 2012, ApJ, 745, 110 

Ota, K. et al. 2010, ApJ, 722, 803 

Ouchi, M. et al. 2010, ApJ, 723, 869 

Paul, J., Wei, J., Basa, S., & Zhang, S.-N. 2011, Comptes Rendus Physique, 12, 298 
Perley, D. A. et al. 2009, AJ, 138, 1690 


2013, ApJ, 778, 128 



31 


— 2011, AJ, 141, 36 

Planck Collaboration et al. 2015, ArXiv.org, 1502.01589 

Prochaska, J. X., Chen, H.-W., Dessanges-Zavadsky, M., & Bloom, J. S. 2007, ApJ, 666, 
267 

R Core Team 2013, R: A Language and Environment for Statistical Computing, R Founda¬ 
tion for Statistical Computing, Vienna, Austria 

Robertson, B. E. & Ellis, R. S. 2012, ApJ, 744, 95 

Robertson, B. E., Ellis, R. S., Furlanetto, S. R., & Dunlop, J. S. 2015, ApJ, 802, L19 
Salvaterra, R. et al. 2012, ApJ, 749, 68 

— 2009, Nature, 461, 1258 

Savaglio, S., Glazebrook, K., & Le Borgne, D. 2009, ApJ, 691, 182 

Savaglio, S. et al. 2012, MNRAS, 420, 627 

Schady, P. et al. 2015, ArXiv.org, 1505.04415 

Schechter, P. 1976, ApJ, 203, 297 

Schenker, M. A. et al. 2013, ApJ, 768, 196 

Schlafly, E. F. & Finkbeiner, D. P. 2011, ApJ, 737, 103 

Schmidt, K. B. et al. 2014, ApJ, 786, 57 

Schulze, S. et al. 2015, ArXiv.org, 1503.04246 

Shimasaku, K. et al. 2006, PASJ, 58, 313 

Shull, J. M., Harness, A., Trenti, M., & Smith, B. D. 2012, ApJ, 747, 100 
Sparre, M. et al. 2014, ApJ, 785, 150 
Stanek, K. Z. et al. 2006, Acta Ast., 56, 333 

— 2003, ApJ, 591, L17 

Svennson, K. M. 2011, PhD thesis. University of Warwick 
Tanvir, N. R. et al. 2009, Nature, 461, 1254 



- 32 - 


— 2012, ApJ, 754, 46 

Therneau, T. 2012, A Package for Survival Analysis in S, R package version 2.36-14 
Th5ne, C. C. et al. 2013, MNRAS, 428, 3590 

Tody, D. 1993, in Astronomical Society of the Pacific Conference Series, Vol. 52, Astronom¬ 
ical Data Analysis Software and Systems II, ed. R. J. Hanisch, R. J. V. Brissenden, 
& J. Barnes, 173 

Trenti, M., Perna, R., & Jimenez, R. 2015, ApJ, 802, 103 

Trenti, M., Perna, R., Levesque, E. M., Shull, J. M., & Stocke, J. T. 2012, ApJ, 749, L38 
Trenti, M., Perna, R., & Tacchella, S. 2013, ApJ, 773, L22 

Trenti, M., Stiavelli, M., Bouwens, R. J., Oesch, P., Shull, J. M., Illingworth, G. D., Bradley, 
L. D., & Carollo, C. M. 2010, ApJ, 714, L202 

Tseliakhovich, D. & Hirata, C. 2010, Phys. Rev. D, 82, 083520 

Vieira, J. D. et al. 2010, ApJ, 719, 763 

Wanderman, D. & Piran, T. 2010, MNRAS, 406, 1944 

Wang, F. Y. & Dai, Z. G. 2014, ApJS, 213, 15 

Woosley, S. E. 1993, ApJ, 405, 273 

Woosley, S. E. & Heger, A. 2006, ApJ, 637, 914 

Xu, D. et al. 2013, GRB Coordinates Network, 14273, 1 

Yoon, S.-C., hanger, N., & Norman, C. 2006, A&A, 460, 199 

Yoshida, N., Sokasian, A., Hernquist, L., & Springel, V. 2003, ApJ, 591, LI 

Yiiksel, H., Kistler, M. D., Beacom, J. F., & Hopkins, A. M. 2008, ApJ, 683, L5 

Zheng, W. et al. 2012, Nature, 489, 406 


This preprint was prepared with the AAS lAIBX macros v5.2. 



Table 1. 


GRB Host Galaxy Observations 
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Note. — “ Redshifts with errors are photometric redshifts. ^ Upper limits are at the 2a confidence level. 
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Note. — (1) The third column contains measured magnitudes/fluxes, i.e. without any correction for Galactic foreground or host-intrinsic extinction. Observed 
magnitudes RUHR are in the Vega system, and r'i'z' magnitudes and HST measurements are in the AB system, except where noted. (2) Spitzer upper limits (Sa) 
from [Laskar et al.| | |20lT] are used only when no competitive limits are available at other wavelengths. (3) All upper limits are 2a confidence level; in some cases, 
reported 3 ct upper limits have been transformed into 2a limits by adding 0.44 mag (e.g. for measurements from |Hjorth et al.|2012] . Observed magnitudes are corrected 
for Galactic foreground extinction jSchlafly &: Finkbeiner|2011} before conversion to absolute magnitudes, using Ay = — V) and Ar{Ai) = 0.80(0.61) X Ay, 
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